In many real world data analysis tasks, it is expected that we can get much more useful knowledge by utilizing multiple\ndatabases stored in different organizations, such as cooperation groups, state organs, and allied countries. However, in many\nsuch organizations, they often hesitate to publish their databases because of privacy and security issues although they believe\nthe advantages of collaborative analysis. This paper proposes a novel collaborative framework for utilizing vertically partitioned\ncooccurrence matrices in fuzzy co-cluster structure estimation, in which cooccurrence information among objects and items is\nseparately stored in several sites. In order to utilize such distributed data sets without fear of information leaks, a privacy preserving\nprocedure is introduced to fuzzy clustering for categorical multivariate data (FCCM).Withholding each element of cooccurrence\nmatrices, only object memberships are shared by multiple sites and their (implicit) joint co-cluster structures are revealed through\nan iterative clustering process. Several experimental results demonstrate that collaborative analysis can contribute to revealing\nglobal intrinsic co-cluster structures of separate matrices rather than individual site-wise analysis. The novel framework makes it\npossible for many private and public organizations to share common data structural knowledge without fear of information leaks.
Loading....